1 



SEQUENCE LISTING 

<110> Lukyanov, Sergey 

<120> FLUORESCENT PROTEINS FROM COPEPODA SPECIES AND METHODS FOR USING 
<13Q> COP_TEXT.DOC 
<160> 30 

<170> Patent In version 3.1 
<210> ]. 
<211> 1010 
<212> DNA 

<213> Pontellina plumata 
<400> 1 



agtctgctct 


ccaaaggata 


gacagtaaca 


ccaccaatat 


gcctgccatg 


aagattgagt 


60 


gccgcatcag 


tggaaccctg 


aacggagtgg 


tgtttgagct 


ggtcggaggt 


ggaoaaqaga 

-S ZJ ^3 ^3 


120 


C tcctgagca 


gggacgratg 


accaacaaga 


tgaagtctac 


caagggcgcc 


rtgacctrct 


180 


ccccccacct 


tctctctcai; 


g'CCargggat 


acgggttcta 


Ccactttggg 


acctaiccca 


:!40 


gtgggtatga 


gaatcccrtc 


ctgcatgccg 


ccaacaacgg 


ggggracacc 


aacaccagga 


:joo 


1; tgagaagta 


tgaggatgga 


ggagttcttc 


atgttagctt 


cagctacaga 


tatgaagcag 


j60 


gcagggttat 


tggggatttc 


aaggttgtcg 


ggacaggatt 


ccctgaggac 


agtgtgatct 


420 


tcaccgacaa 


gatcatccgg 


tccaatgcta 


ccgtggagca 


cttgcaccca 


atgggagaca 




acgttcttgt 


gggctccttc 


gcgagaacct 


tttccctgag 


ggatggaggc 


tactactcat 


mo 


ttgtggttga 


cagccacatg 


cacttcaaga 


gtgccatcca 


cccatccatc 


ctccagaacg 




gggggtccat 


gtttgccttc 


aggagagttg 


aggaacttca 


ctccaacact 


gaacttggca 




ttgtagagta 


tcaacargcc 


ttcaagactc 


ccacagcatt 


tgcctgaact 


agaaagtatc 


720 


aaatataaac 


agagtgacaa 


aggatctgtc 


gtcattctaa 


actttgtar g 


atttacaaat 


7 30 


aatgatti;$a 


tggcaactcc 


caaaatagac 


ttgaattaat 


tgaaaaatca 


accaaacara 


E-40 


atCCttgttg 


ctctgttgat 


atgaacgctt 


tctgacttgg 


accccggctt 


gaactgaccc 


S>00 


tgaaccacat 


cagacgaata 


acttgattct 


aaaattatat 


gaattttcaa 


acaaaacaat 




ataatttgtt 


aatgtgtaat 


catcttgaat 


aaacatatca 


gagaac-ccac 




10 10 



<210> 2 

<211> 222 

<212> PRT 

<213> Pontellina plumata 

<400> 2 



Met Pro Ala Met: Lys He Glu Cys 
1. 5 
Val Val Phe Glu Leu Val Gly Gly 
20 



Arg He Ser Gly Thr Leu Asn Gly 

10 15 
Gly Glu Gly He Pro Glu Gin Gly 
25 30 



2 



Arg Met Thr Asn Lys Met Lys Ser Thr Lys Gly Ala Leu Thr Phe Ser 

35 40 45 

Pro Tyr Leu Leu Ser His Val Met Gly Tyr Gly Phe Tyr His Phe Gly 

50 55 60 

Thr Tyr Pro Ser Gly Tyr Glu Asn Pro Phe Leu His Ala Ala Asn Asn 
65 70 75 80 

Gly Gly Tyr Thr Asn Thr Arg He Glu Lys Tyr Glu Asp Gly Gly Val 

85 90 95 

Leu His Val Ser Phe Ser Tyr Arg Tyr Glu Ala Gly Arg Val He Gly 

100 105 HO 

Asp Phe Lys Val Val Gly Thr Gly Phe Pro Glu Asp Ser Val He Phe 

115 120 125 

Thr Asp Lys He He Arg Ser Asn Ala Thr Val Glu His Leu His Pro 

130 135 140 

Met Gly Asp Asn Val Leu Val Gly Ser Phe Ala Arg Thr Phe Ser Leu 
145 150 155 160 

Arg Asp Gly Gly Tyr Tyr Ser Phe Val Val Asp Ser His Met His Phe 

165 170 175 

Lys Ser Ala He His Pro Ser He Leu Gin Asn Gly Gly Ser Met Phe 

180 185 190 

Ala Phe Arg Arg Val Glu Glu Leu His Ser Asn Thr Glu Leu Gly He 

195 200 205 

Val Glu Tyr Gin His Ala Phe Lys Thr Pro Thr Ala Phe Ala 



<210> 3 
<211> 1010 
<212> DNA 

<213> Pontellina plumata 
<400> 3 

agtctgctct ccaaaggata gacagtaaca ccaccaatat gcctgccatg aagairtgagt CO 

gccgcatcac gggaaccctg aacggagtgg agtttgagct ggtcggaggt ggagaaggga 12 0 

ctcctgagoa gggacgtatg accaacaaga tgaagtctac caagggcgcc ttgaccttct ISO 

coccctacct tctctctcat gtcatgggat acgggttcta ccactttggt acctatccca 2-10 

gtgggtatga gaattccctrc ctgcatgcca tcaacaacgg ggggtacacc aaoaccagga 300 

ttgagaagta tgaggatgga ggagttct'CC atgttagctt tagctacaga tabgaagcag 360 

gcagggtgat tggggatttc aaggttgtcg ggacaggatt ccctgaggac agtgtgatct 4 '10 

tcaocgacaa gatcatccgg tccaatgcta ccgtggagca cttgcaccca atgggagaca 4:<0 

acgttcttgt gggctccttc gcgagaacct tttccctgag ggatggaggc tactactcat 5-10 

•ttgtggttga cagccacatg cacttcaaga gtgccatcca cccatccatc ctccagaacg 6o0 



3 



gggggcccat 


gtttgccttc aggagagttg aggaacttca 


ctccaacact 


gaacttggca 


(>60 


ttgtagagta 


tcaacatgcc ttcaagactc ccatagcatt. 


tgcttaaact 


acaaagiatc 


720 


aaat att^ac 


agattgacaa aggatatgtc gtcattctaa 


actttgtatg 


attracaaat 


780 


aatgatrtaa 


rgrcaaccct caaaataggc ttgaattaat 


tgaaaaatca 


actaaacata 


£:4G 


atccttgrtg 


C'CCtgt-cgat atgaacactt tctgacttgg 


accccggcct 


gaactgaccc 


900 


tgacccacat 


cagacgaaga acttgattct aagattatat 


gaattttcaa 


aaaaaacaat 




argatttgtt 


aargtgtaat catcttgaat aaacatarca 


gagaacgcac 






1010 


<210> <J 
















<211> 222 
















<212> PRT 
















<213> Poatellina plumata 










< 4 0 0> 4 
















Met Pro Ala 


Met Lys 


He 


Glu Cys Arg He Thr 


Gly Thr Leu 


Asn 


Gly 




1 


5 




10 




15 






Val Glu Phe 


Glu Leu 


Val 


Gly Gly Gly Glu Gly 


Thr Pro Glu 


Gin 


Gly 






20 




25 


30 








Arg Met Thr 


Asn Lys 


Met: 


Lys Ser Thr Lys Gly Ala Leu Thr 


Phe 


Ser 




35 






40 


4 5 








Pro Tyr Leu 


Leu Ser 


His 


Val Met Gly Tyr Gly 


Phe Tyr His 


Phe 


Gly 




50 






55 


60 








Thr Tyr Pro 


Ser Gly 


Tyr 


Glu Asn Pro Phe Leu 


His Ala He 


Asn 


Asn 




65 




70 


75 






80 




Gly Gly Tyr 


Thr Asn 


Thr 


Arg He Glu Lys Tyr 


Glu Asp Gly 


Gly 


Val 






85 




90 




95 






Leu His Val 


Ser Phe 


Ser 


Tyr Arg Tyr Glu Ala 


Gly Arg Val 


He 


Gly 






100 




105 


110 








Asp Phe Lys 


Val Val 


Gly 


Thr Gly Phe Pro Glu 


Asp Ser Val 


He 


Phe 




115 






120 


125 








Thr Asp Lys 


He lie 


Arg 


Ser Asn Ala Thr Val 


Glu His Leu 


His 


Pro 




130 






135 


140 








Met Gly Asp 


Asn Val 


Leu 


Val Gly Ser Phe Ala 


Arg Thr Phe 


Ser 


Leu 




145 




150 


155 






160 




Arg Asp Gly 


Gly Tyr 


Tyr 


Ser Phe Val Val Asp 


Ser His Met 


His 


Phe 






165 




170 




175 






Lys Ser Ala 


He His 


Pro 


Ser He Leu Gin Asn 


Gly Gly Pro 


Met 


Phe 






180 




185 


190 








Ala Phe Arg 


Arg Val 


Glu 


Glu Leu His Ser Asn 


Thr Glu Leu 


Gly 


He 





195 200 205 



4 



Val Glu Tyr Gin His Ala Phe Lys Thr Pro He Ala Phe Ala 
210 215 220 

<210> 5 

<211> 814 

<212> DNA 

<213> Labidocera aestiva 

<400> 5 



cagtttctcc 


caagctaaar. 


aaagaaacac 


atcaaaagca 


tcaacatgcc 


tgt caigaag 


60 


at tgagrgcc 


gtatcrctgg 


aaccatgaac 


ggagaggagr 


t;"C gagctr.gr. 


aggagctggc 


> 20 


ganggaaaca 


ctgatgaagg 


acgtatgacc 


aacaagar,ga 


agtccaccaa 


aggacctctc 


:l 8 o 


tCCttctCIC 


cctacciacT: 


ctcccacarc 


atgggctacg 


gattctatca 


ctatgctacc 


:r 40 


ttccctgcrg 


gatatgagaa 


tgtctacctc 


cat get get a 


agaa tggagg 


ctacaccaac 


300 


accaggacrg 


agaggisacga 


agacggagga 


arcatttegg 


tcaacttcac 


ctacagaca t 


360 


gagggaaaca 


aggttatcgg 


agacrtcaag 


grtgttggat 


caggattccc 


agcraacagt 


•120 


gctatctrca 


ctgacaagar 


carcaagtcc 


aacccaacct 


gtgagcacat 


ctaccccaag 


480 


ggagaraata 


ttctigncaa 


tgcctacact 


cgaacttgga 


rgctgagaga 


tggt.ggac.ac 


!-40 


tactctgcac 


aggtcaacaa 


tcatctccac 


ttcaagactg 


ccatgcatcc 


caccatgctc 


1. 00 


cagaacggag 


garocatgnr 


tacctacagg 


aaggttgagg 


agcnccacag 


ccagtcagat 


C60 


gttggtattg 


tagaatacca 


acatgtcttc 


aagaccccaa 


ctgetttrge 


ctaagcttgg 


720 


aaatatggtt 


cctatcagac 


aattaaraca 


ataaactrta 


cttatcattg 


raaaaccaaa 


-.•so 


crcr-cc-caat 


gaataaattt 


ccgtatctac 


ract 






&14 



<210> 6 
<211> 222 
<212> PRT 

<213> Labidocera aestiva 
<400> 6 

Met Pro Val Mer Lys He Glu Cys Arg He Ser Gly Thr Met Asn Gly 

15 10 15 

Glu Glu Phe Glu Leu Val Gly Ala Gly Asp Gly Asn Thr Asp Glu Gly 

20 25 30 

Arg Met Thr Asn Lys Men Lys Ser Thr Lys Gly Pro Leu Ser Phe Ser 

35 40 45 

Pro Tyr Leu Leu Ser His He Met Gly Tyr Gly Phe Tyr His Tyr Ala 

50 55 60 

Thr Phe Pro Ala Gly Tyr Glu A$n Val Tyr Leu His Ala Ala Lys Asn 
65 70 75 80 



5 



Gly Gly Tyr Thr Asn Thr Arg Thr GlU Arg Tyr Glu Asp Gly Gly lie 

85 90 95 

He Ser Val Asn Phe Thr Tyr Arg Tyr Glu Gly Asn Lys Val He Gly 

100 105 HO 

Asp Phe Lys Val Val Gly Ser Gly Phe Pro Ala Asn Ser Val lie Phe 

115 120 125 

Thr Asp Lys He He Lys Ser Asn Pro Thr Cys Glu His He Tyr Pro 

1,30 135 140 

Lys Gly Asp Asn He Leu Val Asn Ala Tyr Thr Arg Thr Trp Mer Leu 
145 150 155 160 

Arg Asp Gly Gly Tyr Tyr Ser Ala Gin Val Asn Asn His Leu His Phe 

165 170 175 

Lys Thr Ala Met His Pro Thr Met Leu Gin Asn Gly Gly Ser Met Phe 

180 185 190 

Thr Tyr A.cg Lys Val Glu Glu Leu His Ser Gin Ser Asp Val Gly He 

195 200 205 

Val Glu Tyr Gin His Val Phe Lys Thr Pro Thr Ala Phe Ala 



<210> 7 
<211> 753 
<212> DNA 

<2l3> cf. Pontella meadi Wheeler 
<400> 7 

atcagtxcat cagtacacga gcagagtcac acatcaaaar gcctgacatg aagcttgagt 60 

gccacatctc cggaaccatg aatggagagg agtttgaacr ratrggngcr ggagatggaa 120 

aracagarga gggacgcatg accaacaaaa tgaagtccat caaaggacct atcrccttct 180 

crccctacct cctctcccac attcttggct acggatatta ccactttgca accrtccctg 'AAQ 

ctggatatga aaatatctac cttcatgcca igaagaatgg aggrractcc aatgtcagaa 300 

crgagaggta tgaggatgga ggcatcattt ctataacctt caacracaga tatgaaggga 360 

acaagatcat rggagacttc aaggttgttg gaacaggatt ccctaccaac agrcrratct '120 

rcacrgacaa gargattaaa tccaacccta cctgtgagaa catgncccc aaggctgaca 480 

atacxcttgt gaatgcctac accagaacat atttgcttaa agatggtgga tactactctg !540 

cccaggrtaa caaccatatg cacttcaaga gtgccatcca raccaccatg ctccagaatg 600 

gcggaiccat gticacctac agagttgrag aggagacaca cactcagaac gaagttgcta 660 

rtgtagagta ccaaaaigtc iccaaaaccc caactgcgtt rgctrgaaar acct.gtaata 720 

aaactgcaaa gaaataaact aaattgtaca ate 753 

<210> 9 
<211-> 222 



6 



<212> PRT 

<213> cf . Ponrella meadi Wheeler 
<400> 8 



Met Pro Asp Met 
1 

Glu Glu Phe Glu 
20 

Arg Met Thr Asn 
35 

Pro Tyr Leu Leu 
50 

Thr Phe Pro Ala 
65 

Gly Gly Tyr Ser 

lie Ser lie Thr 
100 

Asp Phe Lys Val 
115 

Thr Asp Lys lie 
130 

Lys Ala Asp Asn 
145 

Lys Asp Gly Gly 

Lys Ser Ala lie 
180 

Thr Tyr Arg Val 
195 

Val GlO Tyr Gin 
210 



Lys Leu Glu Cys 
5 

Leu lie Gly Ala 

Lys Mer Lys Ser 
40 

Ser His lie Leu 
55 

Gly Tyr Glu Asn 
70 

Asn Val Arg Thr 
85 

Phe Asn Tyr Arg 

Val Gly Thr Gly 
120 

lie Lys Ser Asn 
135 

Thr Leu Val Asn 
150 

Tyr Tyr Ser Ala 
165 

His Thr Thr Mer 

Val Glu Glu Thr 
200 

Asn Val Phe Lys 
215 



His lie Ser Gly 
10 

Gly Asp Gly Asn 
25 

lie Lys Gly Pro 

Gly Tyr Gly Tyr 
60 

lie Tyr Leu His 
75 

Glu Arg Tyr Glu 
90 

Tyr Glu Gly Asn 
105 

Phe Pro Thr Asn 

Pro Thr Cys GlU 
140 

Ala Tyr Thr Arg 
155 

Gin Val Asn Asn 
170 

Leu Gin Asn Gly 
185 

His Thr Gin Asn 

Thr Pro Thr Ala 
220 



Thr Met Asn Gly 
15 

Thr Asp Glu Gly 
30 

lie Ser Phe Ser 
45 

Tyr His Phe Ala 

Ala Met Lys Asn 
80 

Asp Gly Gly lie 
95 

Lys lie lie Gly 
110 

Ser Leu He Phe 
125 

Asn Mer Phe Pro 

Thr Tyr Leu Leu 
160 

His Met i-Iis Phe 
17 5 

Gly Ser Mer Phe 
190 

Glu Val Ala He 
205 

Phe Ala 



<210> 9 

<211> 880 

<212> DNA 

<213> cf. Ponrella meadi Wheeler 

<400> 9 



tcctgigtcc cagtcatcac cgggccctgt gaggaggaag agcacacaga caggagagta 60 
taaatacaga gcggaagcac ggtgarcatc agttcctcag taaacgagta gagacacaca J 20 



7 



tcaaaar gcc 


tgacatgaag 


cttgag-cgcc 


acatciccgg 


aaccatgaat 


ggagaggagt 


180 


ttgaactrai: 


tggttctgga 


gatggaaata 


crgatcaggg 


acgcatgaca 


aacaaratga 


24 0 


agtccaicaa 


aggaccrcrc 


rccbtctcrc 


cctacctact 


ctcccacatt 


cttggctatg 


300 


gatattacca 


ctttgcaacc 


ttccctgccg 


gatatgaaaa 


ratctacctr 


cargccarga 


360 


agaatggagg 


ttactcaaat 


gtcaggactg 


agaggtarga 


ggatggaggc 


atcatrtcta 


4 20 


taaccttcaa 


cracagatat 


gaaggcagca 


agatcatrgg 


agacttcaaa 


gttatrggaa 


480 


caggattccc 


taccgacagt 


citatcrtca 


ctgacaagat 


camaaatcc 


aaccctacct 


. r >4 0 






rrrf nacaaca 

U L U f— L > — C-Lt-L v*. w-t 






ci y a a. lq l v 


600 


tgcttaaaga 


rggtggatac 


tactctgccc 


aggtraacaa 


ccataigcac 


rtcaagagrg 


1^60 


ccatccatcc 


tacaatgctc 


cagaarggtg 


garccargtt 


cactcacaga 


gragragagg 


7 20 


agaaccacac 


taagaccaac 


grtgctatcg 


tagagtacca 


aaatgtcttc 


aaaactccta 


-'80 


crgcarttgc 


tcaaaatacr 


rgtaacaaaa 


ctgcaaagaa 


araaccrata 


ttgtacaata 


8 4 0 


gcattrratt 


aargcataga 


aaaataaarg 


ratattttar 






980 



<210> 10 
<211> 222 
<212> PRT 

<213> cf . Pontella meadi Wheeler 
<400> 10 

Met Pro Asp Men Lys Leu Glu Cys His lie Ser Gly Thr Mer Asn Gly 

15 10 15 

Glu Glu Phe Glu Leu lie Gly Ser Gly Asp Gly Asn Thr Asp Gin Gly 

20 25 30 

Arg Mer Thr Asn Asn Mer Lys Ser lie Lys Gly Pro Leu Ser Phe Ser 

35 40 45 

Fro Tyr Leu Leu Ser His lie Leu Gly Tyr Gly Tyr Tyr His Phe Ala 

50 55 60 

Thr Phe Pro Ala Gly Tyr Glu Asn lie Tyr Leu His Ala Mer Lys Asn 
65 70 75 80 

Gly Gly Tyr Ser Asn Val Arg Thr Glu Arg Tyr Glu Asp Gly Gly lie 

85 90 95 

lie Ser lie Thr Phe Asn Tyr Arg Tyr Glu Gly Ser Lys lie He Gly 

100 105 110 

Asp Phe Lys Val lie Gly Thr Gly Phe Pro Thr Asp Ser Leu He Phe 

115 120 125 

Thr Asp Lys He He Lys Ser Asn Pro Thr Cys Glu Asn Met Phe Pro 

130 135 140 

Lys Ala Asp Asn He Leu Val Asn Ala Tyr Thr Arg Thr Tyr Leu Leu 
145 150 155 160 



8 



Lys Asp Gly Gly Tyr Tyr Ser Ala Gin Val Asn Asn His Met His Phe 

165 170 175 

Lys Ser Ala lie His Pro Thr Met Leu Gin Asn Gly Gly Ser Met Phe 

180 185 190 

Thr His Arg Val Val Glu Glu Asn His Thr Lys Thr Asn Val Ala lie 

195 200 205 

Val Glu Tyr Gin Asn Val Phe Lys Thr Pro Thr Ala Phe Ala 
210 215 220 

<210> 11 

<211> 847 

<212> DNA 

<213> Pontella mediierranea 

<400> 11 



agcagtggta 


tcaacgcaga 


g-cacgcggga 


grtcctcaac 


gaaaaccaag 


agaaacagac 


60 


atcaagatgc 


ccaacatgaa 


get tgagtgc 


cgtarctccg 


gaaccatgaa 


tggagaggag 


120 


rttgaacttg 


XLzgqzqctgg 


agaaggaaac 


accgargagg 


gaegcatgae 


caacaagatg 


1.80 


aagtccacca 


agggacctct 


ttccttctct 


ecttatttge 


tctcccacgt 


tcttggtrat 


?40 


ggatactacc 


acrargcxac 


Cttccctgct 


ggatatgaaa 


argrctaccr 


ccatgccarg 


•JO0 


aagaa'tggag 


gttactccaa 


cacaagaact 


gagaggtatg 


aggatggagg 


tatcat ttct 


-J B0 


gctaccrtca 


actacagata 


tgaagggaga 


caga^catg 


gagacttcaa 


ggrrgragga 


420 


acgggattcc 


ctgccgacag 


caccarcitc 


actgacaaga 


tcarcaagrc 


caacccracc 


•180 


tgtgagcaca 


tctaccccaa 


ggctaacaat 


arrcttgtga 


atgettacac 


cagaaccrgg 


■)40 


atgcttagag 


arggrggara 


etactctgec 


caggtcaaca 


accacatgca 


tttacagagt 


oOO 


gccattcatc 


ccaccat-gct 


caagaatggt 


ggatccatgt 


tcacctacag 


aaaggttgag 


o60 


gagctccaca 


cacaaactga 


agteggratt 


gttgaatacc 


agcatgtcct 


caagaggeca 


720 


actgcttttg 


ctraattfcg 


taaataaaga 


aagaatctat 


aatgeaatag 


Caccttaaag 


780 


t-cctcaggat 


aataaatara 


taaagacttt 


taataaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


^40 


aaaaaaa 












H47 



<210> 12 
<211> 222 
<212> PRT 

<213> Ponrella mediterranea 
<400> 12 

Met Pro Asn Met: Lys Leu Glu Gys Arg lie Ser Gly Thr Met Asn Gly 
15 10 15 

Glu Glu Phe GlU Leu Val Gly Ala Gly Glu Gly Asn Thr Asp Glu Gly 
20 25 30 



9 



Arg Met Thr Asn Lys Met Lys Ser Thr Lys Gly Pro Leu Ser Phe Ser 

35 40 45 

Pro Tyr Leu Leu Ser His Val Leu Gly Tyr Gly Tyr Tyr His Tyr Ala 

50 55 60 

Thr Phe Pro Ala Gly Tyr Glu Asn Val Tyr Leu His Ala Met Lys Asn 
65 70 75 80 

Gly Gly Tyr Ser Asn Thr Arg Thr Glu Arg Tyr Glu Asp Gly Gly lie 

85 90 95 

lie Ser Ala Thr Phe Asn Tyr Arg Tyr Glu Gly Arg Gin lie His Gly 

100 105 110 

Asp Phe Lys Val Val Gly Thr Gly Phe Pro Ala Asp Ser lie lie Phe 

115 120 125 

Thr Asp Lys lie lie Lys Ser Asn Pro Thr Cys Glu His lie Tyr Pro 

130 135 140 

Lys Ala Asn Asn lie Leu Val Asn Ala Tyr Thr Arg Thr Trp Met Leu 
145 150 155 160 

Arg Asp Gly Gly Tyr Tyr Ser Ala Gin Val Asn Asn His Met His Leu 

165 170 175 

Gin Ser Ala lie His Pro Thr Mei: Leu Lys Asn Gly Gly Ser Met Phe 

180 185 190 

Thr Tyr Arg Lys Val Glu Glu Leu His Thr Gin Thr Glu Val Gly lie 

195 200 205 

Val Glu Tyr Gin His Val Phe Lys Arg Pro Thr Ala Phe Ala 



<210> 13 
<211> 850 
<212> DNA 

<213> Pontelia inediterranea 
<400> 13 

gcagtggtat caacgcagag tacgcgggga gttcctcaac gaaaaccgag agaaacatac 60 

atcaaaatgc cccacatgaa gcttgagtgc cgtatcrccg gaaccatgaa cggagaggag ^20 

tttgaacttg ttggtgctgg agatggaaac actgatgagg gacgcatgac caaccagatg J 80 

aagtccacaa agggacctct crccttctcc ccctacttgc tctcccacgt: tcttggctat <':40 

ggataccacc actatgcrac ctrccctgct gga^atgaaa atgtcn&cc-c ccargccatg /00 

aagaarggag gttactccaa cacaagaact gagaggtatg acgatggagg tatcatttct :'-60 

gctaccttca actacagata tgaagggaga cagattcatg gagactrcaa ggttgttgga 4 20 

acrggacrcc ctgccgacag catcatcttc actgacaaga tcatcaagtc caaccctacc 430 

tgtgagcaca cctaccccaa ggctgacaar ar.r,ct tgtga atgcctacac cagaaccrgg 5.10 

argcttagag arggtggata ctacrctgct caggrcaaca accacargca crtraagagt (.00 



10 



gccatccatc ccaccatgct ccagaarggt ggatcratgt tcacctacag aaaggttgag 

gagctccaca cacaaactga agttggtatt gttgagtacc agcatgt^rr caagaggccc 

acagcttttg cttaarrttg taaataaaga aagaatttat aatacaatag tgcttrrarg 

tctciaaaac aatgaatgta raaataaatc tcaaaatatT; caaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa 

<210> 14 
<211> 222 
<212> PRT 

<213> Pontella mediter.ranea 
<400> 14 

Men Pro His Met Lys Leu Glu Cys Arg lie Ser Gly Thr Met Asn Gly 

15 10 15 

Glu Glu Phe Glu Leu Val Gly Ala Gly Asp Gly Asn Thr Asp Glu Gly 

20 25 30 

Arg Mec Thr Asn Gin Met Lys Ser Thr Lys Gly Pro Lea Ser Phe Ser 

35 40 45 

Pro Tyr Leu Leu Ser His Val Leu Gly Tyr Gly Tyr Tyr His Tyr Ala 

50 55 60 

Thr Phe Pro Ala Gly Tyr Glu Asn Val Tyr Leu His Ala Met Lys Asn 
65 70 75 20 

Gly Gly Tyr Ser Asn Thr Arg Thr Glu Arg Tyr Asp Asp Gly Gly He 

85 90 95 

He Ser Ala Thr Phe Asn Tyr Arg Tyr Glu Gly Arg Gin He His Gly 

100 105 HO 

Asp Phe Lys Val Val Gly Thr Gly Phe Pro Ala Asp Ser He He Phe 

115 120 125 

Thr Asp Lys lie He Lys Ser Asn Pro Thr Cys Glu His lie Tyr Pro 

130 135 140 

Lys Ala Asp Asn He Leu Val Asn Ala Tyr Thr Arg Thr Trp Met Leu 
145 150 155 160 

Arg Asp Gly Gly Tyr Tyr Ser Ala Gin Val Asn Asn His Met His Phe 

165 170 175 

Lys Ser Ala He His Pro Thr Met Leu Gin Asn Gly Gly Ser Met Phe 

180 185 190 

Thr Tyr Arg Lys Val Glu Glu Leu His Thr Gin Thr Glu Val Gly lie 

195 200 205 

Val Glu Tyr Gin His Val Phe Lys- Arg Pro Thr Ala Phe Ala 
210. 215 220 



660 
720 
7 80 
840 
850 



11 



<210> IS 

<211> 821 

<212> DNA 

<213> Unknown 
<220> 

<223> nucleic acid sequence for pdaelGFP from an unidentified 

Pontellidae species, complete cds 

<400> 15 



atc.agt.ttaa 


cttctttcag 


aagacaacta 


agacctacca 


acatggcagc 


catgaagatt 


60 


gagtgcagga 


rcactggaac 


catgaacgga 


gtggagtttg 


agctggtrgg 


aggaggagaa 


120 


ggaaatactg 


arcagggacg 


tatgaccaac 


aagatgaaat 


ctaccaaggg 


tccacrcrcc 


1 80 


ttctctccct 


atcttctctc 


tcatgrcatg 


ggatatggat 


tctatcattt 


tggaacattT; 


24 0 


cccagtggtt 


atgagaatcc 


ctatgtccac 


gccatgacga 


acggtggata 


taccaacacc 


300 


aggattgaaa 


gttatgaaga 


tggaggtg-ct 


ctttacctta 


cctrcaacra 


cagattggat 


!3 60 


ggaaacaaga 


rrarcgggga 


ctr caagtgt 


grcggaactg 


gattccctga 


ggacagcgtt 


0 20 


atcxtcactg 


acaagatcat 


caagtccaac 


cccaattgrg 


aacatxtcta 


tccaatggct 


<180 


gaaaacatca 


tgaaaaatgc 


ctacatgaga 


actctctccc 


tcagagatgg 


tggctaccac 


54 0 


tctggccagg 


ttaccagcca 


catccacttc 


aagaatgcga 


r ccacccatc 


catccttcat 


600 


aacggcggat 


cca^gt'ccac 


Ctacagaaga 


gttgaggagc 


rccacactca 


aactgatctt 


b60 


ggaattgttg 


agtaccagca 


tgtattcaag 


actcccactg 


crtttgcttg 


aatgccatga 


720 


agatgaaacc 


tgaacaagat 


caatctttat. 


ttaccacaat 


atgtaaattg 


rttaattgta 


78 0 


raatr ctcga 


gaart-cataT: 


aatacataga 


atttatctta 


c 




821 



<210> 16 

<211> 222 

<212> PRT 

<213> Unknown 
<220> 

<223> amino acid sequence for pdaelGFP from an unidentified Pontellidae 

species 

<400> 16 



Met Ala Ala Met 
1 

Val Glu Phe Glu 
20 

Arg Met Thr Asn 
35 

Pro Tyr Leu Leu 
50 



Lys lie Glu Cys 
5 

Leu Val Gly Gly 

Lys Met Lys Ser 
40 

Ser His Val Met 
55 



Arg lie Thr Gly 
10 

Gly Glu Gly Asn 
25 

Thr Lys Gly Pro 

Gly Tyr Gly Phe 
60 



Thr Met: Asn Gly 
15 

Thr Asp Gin Gly 
30 

Leu Ser Phe Ser 
45 

Tyr His Phe Gly 
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Thr Phe Pro Ser Gly 
65 

Gly Gly Tyr Thr Asn 
85 

Leu Tyr Leu Thr Phe 
100 

Asp Phe Lys Cys Val 
115 

Thr Asp Lys He He 
130 

Met Ala Glu Asn He 
145 

Arg Asp Gly Gly Tyr 
165 

Lys Asn Ala lie His 
180 

Thr Tyr Arg Arg Val 
195 

Val Glu Tyr Gin His 
210 



Tyr Glu Asn Pro Tyr 
70 

Thr Arg He Glu Ser 
90 

Asn Tyr Arg Leu Asp 
105 

Gly Thr Gly Phe Pro 
120 

Lys Ser Asn Pro Asn 
135 

Met Lys Asn Ala Tyr 
150 

Tyr Ser Gly Gin Val 
170 

Pro Ser He Leu His 
185 

Glu Glu Leu His Thr 
200 

Val Phe Lys Thr Pro 
215 



Val His Ala Met Thr Asn 
75 80 
Tyr Glu Asp Gly Gly Val 
95 

Gly Asn Lys He He Gly 
110 

Glu Asp Ser Val He Phe 
125 

Cys Glu His Phe Tyr Pro 
140 

Met Arg Thr Leu Ser Leu 
155 160 
Thr Ser His He His Phe 
175 

Asn Gly Gly Ser Met Phe 
190 

Gin Thr Asp Leu Gly He 
205 

Thr Ala Phe Ala 
220 



<210> 17 

<211> 669 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> nucleic acid sequence for the humanized version of zhe ppluGFP; 

<400> 17 



atgcccgcca 


tgaagatcga 


gtgccgcatc 


accggcaccc 


tgaacggcgt 


ggagrtcgag 


60 


ctggtgggcg 


gcggagaggg 


cacccccgag 


cagggccgca 


tgaccaacaa 


gargaagagc 


] 20 


accaagggcg 


ccctgaccrt 


cagcccctac 


crgctgagcc 


acgtgatggg 


ctacggcttc 


1 SO 


taccacttcg 


gcacctaccc 


cagcggctac 


gagaacccct 


tcctgcacgc 


carcaacaac 


;mo 


ggcggctaca 


ccaacacccg 


catcgagaag 


tacgaggacg 


gcggcgtgcr 


gcacgtgagc 


y.oo 


ttcagctacc 


gctacgaggc 


cggccgcgrg 


atcggcgact 


tcaaggrggt 


gggcaccggc 


:>.eo 


ttccocgagg 


acagcgtgat 


ctrcaccgac 


aagatcatcc 


gcagcaacgc 


caccgrggag 


420 


Cacctgcacc 


ccatgggcga 


taacgtgctg 


gtgggcagcr 


tcgcccgcac 


Cttcagcctg 


480 


cgcgacggcg 


gcractacag 


ctrcgrggtg 


gacagccaca 


tgcacttcaa 


gagcgccatc 


5 -10 


caccccagca 


tccrgcagaa 


cgggggcccc 


atgtccgcct 


tccgccgcgt 


ggaggagctg 


too 


cacagcaaca 


ccgagctggg 


catcgtggag 


taccagcacg 


ccttcaagac 


cccgarcgca 


6 iSO 


trcgcctga 












€(6 9 
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<210> 18 

<211> 222 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> amino acid sequence for the humanized version of the ppiuGFP2 

<400> 18 



Met Pro Ala Met 

1 

Val Glu Phe Glu 
20 

Arg Met Thr Asn 
35 

Pro Tyr Leu Leu 
50 

Thr Tyr Pro Ser 
65 

Gly Gly Tyr Thr 

Leu His Val Ser 
100 

Asp Phe Lys Val 
115 

Thr Asp Lys lie 
130 

Met Gly Asp Asn 
1.4 5 

Arg Asp Gly Gly 

Lys Ser Ala lie 
180 

Ala Phe Arg Arg 
195 

Val Glu Tyr Gin 
210 



Lys lie Glu Cys 
5 

Leu Val Gly Gly 

Lys Met Lys Ser 
40 

Ser His Val Met 
55 

Gly Tyr Glu Asn 
70 

Asn Thr Arg lie 
85 

Phe Ser Tyr Arg 

Val Gly Thr Gly 
120 

lie Arg Ser Asn 
135 

Val Leu Val Gly 
150 

Tyr Tyr Ser Phe 
165 

His Pro Ser lie 

Val Glu Glu Leu 
200 

His Ala Phe Lys 
215 



Arg lie Thr Gly 
10 

Gly Glu Gly Thr 
25 

Thr Lys Gly Ala 

Gly Tyr Gly Phe 
60 

Pro Phe Leu His 
75 

Glu Lys Tyr Glu 
90 

Tyr Glu Ala Gly 
105 

Phe Pro Glu Asp 

Ala Thr Val Glu 
140 

Ser Phe Ala Arg 
155 

Val Val Asp Ser 
170 

Leu Gin Asn Gly 
185 

His Ser Asn Thr 

Thr Pro lie Ala 
220 



Thr Leu Asn Gly 
15 

Pro Glu Gin Gly 
30 

Leu Thr Phe Ser 
45 

Tyr His Phe Gly 

Ala lie Asn Asn 
80 

Asp Gly Gly Val 
95 

Arg Val lie Gly 
110 

Ser Val He Phe 
125 

His Leu His Pro 

Thr Phe Ser Leu 
160 

His Met His Phe 
175 

Gly Pro Met Phe 
190 

Glu Leu Gly He 
205 

Phe Ala 



<210> 19 

<211> 589 

<212> DNA 

<2l3> Artificial sequence 
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<220> 

<223> nucleic acid sequence for rhe ppluGFP2 with yeast-optimized ccdon 

usage 
<4Q0> 19 



tactccagaa 


caaggcagaa 


tgactaaraa 


aatgaaaict 


actaaaggtg 


crrtgacttt 


60 


tcctccarat 


ttgttgtcrc 


atgrtarggg 


ttatggtttt 


tatcatt c tg 


gracttatcc 


320 


atctggttar 


gaaaatccau 


tt ttgcatgc 


rattaataar 


ggtggttara 


ctaatactag 


180 


aa ctgaaaaa 


ratgaagatg 


gtggtgtttt 


gcargrrrct 


ZZ ttcctata 


gatatgaagc 


■-M0 


tggcagagtt 


attggcgatt 


ttaaagrtgt 


tggtacrggt 


tttccagaag 


attcigctat 


300 


trttactgat 


aaaartatra 


gatctaatgc 


ractgttgaa 


carrtgcacc 


caargggrga 


3 60 


raatgrtttg 


gttggttctt 


ttgcragaac 


tttrrcrrtg 


agagatggtg 


gt:tatrattc 


-120 


ttrtgttgtt 


gattctcata 


tgcatrttaa 


atctgctatt 


catccatcia 


trttgcaaaa 


430 


rggtggtcca 


atgttrgcit 


ttagaagagu 


tgaagaatrg 


cattctaata 


ctgaattggg 


04 0 


rattgrtgaa 


tatcaacatg 


cttitaaaac 


tccaatngct 


tttgcttaa 




'38 9 



<210> 20 

<211> 222 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> amino acid sequence for rhe ppluGFP2 with yeast-optimized codon u 

sage 

<400> 20 



Met Pro Ala Met Lys lie Glu Cys Arg lie Thr Gly Thr Leu Asn Gly 

15 10 15 

Val Glu Phe Glu Leu Val Gly Gly Gly Glu Gly Thr Pro Glu Gin Gly 

20 25 30 

Arg Met Thr Asn Lys Met Lys Ser Thr Lys Gly Ala Leu Thr Phe Ser 

35 40 45 

Pro Tyr Leu Leu Ser His Val Met Gly Tyr Gly Phe Tyr His Phe Gly 

50 55 60 

Thr Tyr Pro Ser Gly Tyr Glu Asn Pro Phe Leu His Ala lie Asn Asn 
65 70 75 80 

Gly Gly Tyr Thr Asn Thr Arg lie Glu Lys Tyr Glu Asp Gly Gly Val 

85 90 95 

Leu His Val Ser Phe Ser Tyr Arg Tyr Glu Ala Gly Arg Val He Gly 

100 105 110 

Asp Phe Lys Val Val Gly Thr Gly_ Phe Pro Glu Asp Ser Val He Phe 
115 120 125 



15 

Thr Asp Lys lie lie Arg Ser Asn Ala Thr Val Glu His Leu His Pro 

130 135 140 

Met Gly Asp Asn Val Leu Val Gly Ser Phe Ala Arg Thr Phe Ser Leu 
145 150 155 160 

Arg Asp Gly Gly Tyr Tyr Ser Phe Val Val Asp Ser His Met His Phe 

165 170 175 

Lys Ser Ala He His Pro Ser lie Leu Gin Asn Gly Gly Pro Met Phe 

ISO 185 190 

Ala Phe Arg Arg Val Glu Glu Leu His Ser Asn Thr Glu Leu Gly He 

195 200 205 

val Glu Tyr Gin His Ala Phe Lys Thr Pro He Ala Phe Ala 
210 215 220 

<210> 21 
<2ll> 669 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> nucleic acid sequence for the CopCFP muiant 
«J00> 21 

atgcccgcca tgaagatcga gtgccgcatc accggcaccc tgaacggcgt ggagttcgag 60 
ccggtgggcg gcggagaggg cacccccgag cagggccgca tgaccaacaa gatgaagagc 120 
accaaaggcg ccctgacctt cagcccctac ctgctgagcc acgtgarggg ctggggcttt 180 
taccacttcg gcscctaccc cagcggctac gagaacccct tcctgcacgc catcaacaac 240 
ggcggccaca ccaacacccg catcgagaag tacgaggacg gcggcgtgcr gcacgtgagc :00 
ttcagctacc gctacgaggc cggccgcgtg atcggcgact tcaaggtggt gggcaccggc 3 60 
ttccccgagg acagcgtgat cttcaccgac aagatcatcc gcagcaacgc caccgrggag -a 20 
cacccgcgcc ccatgggcga taacgtgctg gtgggcagct tcgcccgcac cttcagcctg -q 8 0 
cgcgacggcg gctactacag cttcgtggtg gacagccaca tgcacttcaa gagcgccatc -}40 
caccccagca tcctgcagaa cgggggcccc atgttcgcct tccgccgcgt ggaggagctg .500 
cacagcaaca ccgagcrggg catcgtggag taccagcacg cctrcaagac cccgaccgca 660 
ttcgcctaa - 69 

<210> 22 

<211> 222 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> amino acid sequence for the CopCFP mutant 

<400> 22 



16 



Met Pro Ala Met Lys lie Glu Cys Arg lie Thr Gly Thr Leu Asn Gly 

15 10 15 

Val Glu Phe Glu Leu Val Gly Gly Gly Glu Gly Thr Pro Glu Gin Gly 

20 25 30 

Arg Met Thr Asn Lys Met Lys Ser Thr Lys Gly Ala Leu Thr Phe Ser 

35 40 45 

Pro Tyr Leu Leu Ser His Val Men Gly Trp Gly Phe Tyr His Phe Gly 

50 55 60 

Thr Tyr Pro Ser Gly Tyr Glu Asn Pro Phe Leu His Ala lie Asn Asn 
65 70 75 80 

Gly Gly Tyr Thr Asn Thr Arg He Glu Lys Tyr Glu Asp Gly Gly Val 

85 90 95 

Leu His Val Ser Phe Ser Tyr Arg Tyr Glu Ala Gly Arg Val He Gly 

100 105 no 

Asp Phe Lys Val Val Gly Thr Gly Phe Pro Glu Asp Ser Val He Phe 

115 120 125 

Thr Asp Lys lie He Arg Ser Asn Ala Thr Val Glu His Leu Arg Pro 

130 135 140 

Met Gly Asp Asn Val Leu Val Gly Ser Phe Ala Arg Thr Phe Ser Leu 
145 150 155 160 

Arg Asp Gly Gly Tyr Tyr Ser Phe Val Val Asp Ser His Met His Phe 

165 170 175 

Lys Ser Ala He His Pro Ser He Leu Gin Asn Gly Gly Pro Met Phe 

180 185 190 

Ala Phe Arg Arg Val Glu Glu Leu His Ser Asn Thr Glu Leu Gly He 

195 200 205 

Val Glu Tyr Gin His Ala Phe Lys Thr Pro Thr Ala Phe Ala 
210 215 220 

<210> 23 
<211> 690 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> nucleic acid sequence for the CopGFP-NAl variant 
<400> 23 

arggagagcg acgagagcgg ccrgcccgcc atggagarcg agrgccgcat caccggcacc 60 
ctgaacggcg tggagttcga gctggtgggc ggcggagagg gcacccccga gcagggccgc liO 
atgaccaaca agatgaagag caccaagggc gccctgacct tcagccccta cctgctgagc 1:30 
cacgtgargg gctacggcxc ciaccacttc ggcacctacc ccagcggcta cgagaacccc 2-10 



17 



tr cctgcacg 


ccatcaacaa 


cggcggctac 


accaacaccc 


gcatcg&gea 


gtacgaggac 


300 


ggcggcgtgc 


tgcacgtgag 


cttcagctac 


cgctacgagg 


ccggccgcgt 


gatcggcgac 


3 60 


ttcaaggtgg 


tgggcaccgg 


cttccccgag 


gacagcgtga 


tcttcaccga 


caagatcatc 


'120 


cgcagcaacg 


ccaccgtgga 


gcacctgcac 


cccatgggcg 


ataacgtgct 


ggtgggcagc 


4 80 


ttcgcccgca 


ccttcagcct 


gcgcgacggc 


ggctactaca 


gcttcgtggt 


ggacagccac 


540 


atgcacttca 


agagcgccat 


ccaccccagc 


atcctgcaga 


acgggggccc 


catgttcgcc 


fiOO 


irccgccgcg 


tggaggagct 


gcacagcaac 


accgagctgg 


gcat cgtgga 


gtaccagcac 


660 


gccttcaaga 


ccccgatcgc 


attcgcctga 








690 



<210> 24 

<2ll> 223 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> amino acid sequence for the CopGFP-NAl variant 

<400> 24 



Met Glu Ser Asp Glu Ser Gly Leu Pro Ala Met. GlU He Glu Cys Arg 

1 5 10 15 

lie Thr Gly Thr Leu Asn Gly Val Glu Phe Glu Leu Val Gly Gly Gly 

20 25 30 

Glu Gly Thr Pro Glu Gin Gly Arg Met Thr Asn Lys Met Lys Ser Thr 

35 40 45 

Lys Gly Ala Leu Thr Phe Ser Pro Tyr Leu Leu Ser His Val Met Gly 

50 55 60 

Tyr Gly Phe Tyr His Phe Gly Thr Tyr Pro Ser Gly Tyr Glu Asn Pro 
65 70 75 80 

Phe Leu His A.la He Asn Asn Gly Gly Tyr Thr Asn Thr Arg He Glu 

85 90 95 

Lys Tyr Glu Asp Gly Gly Val Leu His Val Ser Phe Ser Tyr Arg Tyr 

100 105 110 

Glu Ala Gly Arg Val He Gly Asp Phe Lys Val Val Gly Thr Gly Phe 

115 120 125 

fro Glu Asp Ser Val He Phe Thr Asp Lys He He Arg Ser Asn Ala 

130 135 140 

Thr Val Glu His Leu His Pro Met Gly Asp Asn Val Leu Val Gly Ser 
145 150 155 160 

Phe Ala Arg Thr Phe Ser Leu Arg Asp Gly Gly Tyr Tyr Ser Phe Val 

165 170 175 

Val Asp Ser His Met His Phe Lys Ser Ala He His Pro Ser He Leu 
180 185 190 



Gin Asn Gly 



Ser Asn Thr 



210 



195 



Gly Pro Met Phe Ala Phe Arg Arg Val Glu Glu Leu His 

200 205 

Glu Leu Gly lie Val Glu Tyr Gin His Ala Phe LyS Thr 
215 220 



Pro He Ala Phe Ala 
225 

<210> 25 

<211> 819 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> nucleic acid sequence for the CopGFP-NA2 variant 

<400> 25 



argcccgcca 


tgaagatcga 


gtgccgcar c 


accggcaccc 


tgaacggcgt 


ggagttcgag 


60 


ctggcgggcg 


goggagaggg 


cacccccgag 


cagggccgca 


tgaccaacaa 


gatgaagagc 


120 


accaagggcg 


ccctgacctt 


cagcccctac 


ctgctgagcc 


acgtgatggg 


ctacggcttc 


180 


taccacttcg 


gcacctaccc 


cagcggc-tac 


gagaacccct 


t cctgcacgc 


Catcaacaac 


240 


ggcggctaca 


ccaacacccg 


catcgagaag 


tacgaggacg 


gcggcgtgct 


gcacgtgagc 


"300 


ttcagctacc 


gctacgaggc 


cggccgcgtg 


atcggcgact 


tcaaggtggt 


gggcaccggc 


360 


ttccccgagg 


acagcgtgat 


cttcaccgac 


aagatcatcc 


gcagcaacgc 


caccgtggag 


420 


cacctgcacc 


ccatgggcga 


taacgtgctg 


gtgggcagct 


tcgcccgcac 


cttoagcctg 


4 80 


cgcgacggcg 


gctactacag 


cttcgtggtg 


gacagccaca 


tgcacttcaa 


gagcgccatc 


540 


caccccagca 


tcctgcagaa 


cgggggcccc 


atgttcgcct 


tccgccgcgt 


ggaggagctg 


600 


cacagcaaca 


ccgagctggg 


catcgtggag 


raccagcacg 


ccttcaagac 


cccgatcgca 


660 


ticgccagat 


ccagagccca 


ggccagcaac 


tccgccgtgg 


atggcacagc 


cggaccggga 


720 


tcggccgcga 


ctctagatca 


taatcagcca 


taccacattt 


gtagaggttt 


tacttgcrtt 


780 


aaaaaacctc 


ccacacctcc 


ccctgaacct 


gaaacataa 
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<210> 26 

<211> 272 

<212> PRT 

<213> Artificial sequence 
<220> 

<22 3> amino acid sequence for the CopGFP-NA2 variant 

<400> 26 



Met Pro Ala Met Lys He Glu Cys Arg He Thr Gly Thr Leu Asn Gly 
1 5 10 15 
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Val Glu Phe Glu Leu Val Gly Gly Gly Glu Gly Thr Pro Glu Gin Gly 

20 v 25 30 

Arg Met Thr Asn Lys Met Lys Ser Thr Lys Gly Ala Leu Thr Phe Ser 

35 40 45 

Pro Tyr Leu Leu Ser His Val Met Gly Tyr Gly Phe Tyr His Phe Gly 

50 55 60 

Thr Tyr Pro Ser Gly Tyr Glu Asn Pro Phe Leu His Ala lie Asn Asn 
65 70 75 80 

Gly Gly Tyr Thr Asn Thr Arg lie Glu Lys Tyr Glu Asp Gly Gly Val 

85 90 95 

Leu His Val Ser Phe Ser Tyr Arg Tyr Glu Ala Gly Arg Val He Gly 

100 105 HO 

Asp Phe Lys Val Val Gly Thr Gly Phe Pro Glu Asp Ser Val lie Phe 

115 120 125 

Thr Asp Lys lie He Arg Ser Asn Ala Thr Val Glu His Leu His Pro 

130 135 140 

Met Gly Asp Asn Val Leu Val Gly Ser Phe Ala Arg Thr Phe Ser Leu 
145 150 155 160 

Arg Asp Gly Gly Tyr Tyr Ser Phe Val Val Asp Ser His Mer His Phe 

165 170 175 

Lys Ser Ala He His Pro Ser He Leu Gin Asn Gly Gly Pro Met Phe 

180 185 190 

Ala Phe Arg Arg Val Glu Glu Leu His Ser Asn Thr Glu Leu Gly He 

195 200 205 

Val Glu Tyr Gin His Ala Phe Lys Thr Pro He Ala Phe Ala Arg Ser 

210 215 220 

Arg Ala Gin Ala Ser Asn Ser Ala Val Asp Gly Thr Ala Gly Pro Gly 
225 230 235 240 

Ser Ala Ala Thr Leu Asp His Asn Gin Pro Tyr His He Cys Arg Gly 

245 250 255 

Phe Thr Cys Phe Lys Lys Pro Pro Thr Pro Pro Pro Glu Pro Glu Thr 
260 265 270 

<210> 27 
<2ll> 840 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> nucleic acid sequence for the CopGFP-NA3 variant 
<400> 27 

arggagagcg acgagagcgg cctgcccgcc atggagatcg ag^gccgcat caccggcacc 
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ctgaacggcg 


rggagrtcga 


gctggtgggc 


ggcggagagg 


gcacccccga 


gcagggccgc 


120 


atgaccaaca 


agetgaagag 


caccaagggc 


gcccrgacct 


tcagccccra 


cctgcrgagc 


100 


cacgtgatgg 


gctacggctt 


ctaccacttc 


ggcacctacc 


ccagcggcra 


cgagaacccc 


:-?4 0 


ttcctgcacg 


ccatcaacaa 


cggcggctac 


accaacaccc 


gear cgagaa 


gtacgaggac 


300 


ggcggcgrgc 


tgcacgtgag 


citcagctac 


cgctacgagg 


ccggccgcgt 


gat eggegae 


3 60 


ttcaaggrgg 


tgggcaccgg 


crtccccgag 


gacagcgtga 


tcttcaccga 


caagatcaic 


■3 20 


cgcagcaacg 


ccaccgrgga 


gcacctgcac 


cccargggcg 


ataaegtget 


ggtgggcagc 


380 


t tcgcccgca 


ccttcagcct 


gcgcgacggc 


ggctacraca 


gcrrcgtggr 


ggacagccac 


■)40 


aegcactrca 


agagcgccat 


ccaccccagc 


arcctgcaga 


acgggggccc 


catgttcgcc 


600 






> — j V,. Cl V C- v^- W W 




ocatcataaa 

SrJ k«, VJ W V^J <— L 


GEaccaacac 


i'i60 


gccrrcaaga 


ccccgatcgc 


arrcgccaga. 


tccagagccc 


aggecagcaa 


crccgccgtg 


/20 


gatggcacag 


ccggaccggg 


atcggccgcg 


actctagatc 


ataarcagee 


ataccacait 


780 


Ug tagaggtt 


ttacttgctr. 


taaaaaacct 


cccacaccCC 


ccccigaacc 


rgaaacataa 


:-M0 



<210> 28 

<211> 279 

<212> PRT 

<213> Arrificial sequence 
<220> 

<223> amino acid sequence for the CopGFP-NA3 variant: 

<400> 28 



Met Glu Ser Asp Glu Ser Gly Leu Pro Ala Met: Glu lie Glu Cys Arg 

15 10 15 

lie Thr Gly Thr Leu Asn Gly Val Glu Phe Glu Leu Val Gly Gly Gly 

20 25 30 

Glu Gly Thr Pro Glu Gin Gly Arg Met Thr Asn Lys Mer. Lys Ser Thr 

35 40 45 

Lys Gly Ala Leu Thr Phe Ser Pro Tyr Leu Leu Ser His Val Met Gly 

50 55 60 

Tyr Gly Phe Tyr His Phe Gly Thr Tyr Pro Ser Gly Tyr Glu Asn Pro 
65 70 75 80 

Phe Leu His Ala lie Asn Asn Gly Gly Tyr Thr Asn Thr Arg He Glu 

85 90 95 

Lys Tyr Glu Asp Gly Gly Val Leu His Val Ser Phe Ser Tyr Arg Tyr 

100 105 110 

Glu Ala Gly Arg Val He Gly Asp Phe Lys Val Val Gly Thr Gly Phe 

115 120 125 

Pro Glu Asp Ser Val He Phe Thr Asp Lys He He Arg Ser Asn Ala 
130 135 140 
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Thr Val Glu His Leu 
145 

Phe Ala Arg Thr Phe 
165 

Val Asp Ser His Men 
180 

Gin Asn Gly Gly Pro 
195 

Ser Asn Thr Glu Leu 
210 

Pro lie Ala Phe Ala 
225 

Asp Gly Thr Ala Gly 
245 

Pro Tyr His lie Cys 
260 

Pro Pro Pro Glu Pro 
275 



His Pro Met Gly Asp 
150 

Ser Leu Arg Asp Gly 
170 

His Phe Lys Ser Ala 
185 

Met Phe Ala PhQ Arg 
200 

Gly lie Val Glu Tyr 
215 

Arg Ser Arg Ala Gin 
230 

Pro Gly Ser Ala Ala 
250 

Arg Gly Phe Thr Cys 
265 

Glu Thr 



Asn Val Leu Val Gly Ser 
155 160 
Gly Tyr Tyr Ser Phe Val 
175 

lie His Pro Ser lie Leu 
190 

Arg Val Glu Glu Leu His 
205 

Gin His Ala Phe Lys Thr 
220 

Ala Ser Asn Ser Ala Val 
235 240 
Thr Leu Asp His Asn Gin 
255 

Phe Lys Lys Pro Pro Thr 
270 



<210> 29 

<211> 238 

<212> PRT 

<213> Aequorea vicroria 



<400> 29 



Met Ser Lys Gly 

1 

Glu Leu Asp Gly 
20 

Gly Glu Gly Asp 
35 

Thr Thr Gly Lys 

50 

Ser Tyr Gly Val 
65 

His Asp Phe Phe 

Thr lie Phe Phe 
100 

Lys Phe Glu Gly 



Glu Glu Leu Phe 
5 

Asp val Asn Gly 

Ala Thr Tyr Gly 
40 

Leu Pro Val Pro 
55 

Gin Cys Phe Ser 
70 

Lys Ser Ala Met: 
85 

Lys Asp Asp Gly 

Asp Thr Leu Val 



Thr Gly Val Val 
10 

His Lys Phe Ser 
25 

Lys Leu Thr Leu 

Trp Pro Thr Leu 
60 

Arg Tyr Pro Asp 
75 

Pro Glu Gly Tyr 
90 

Asn Tyr Lys Thr 
105 

Asn Arg lie Glu 



Pro He Leu Val 
15 

Val Ser Gly Glu 
30 

Lys Phe lie Cys 
45 

Val Thr Thr Phe 

His Met: Lys Gin 
80 

Val Gin Glu Arg 
95 

Arg Ala Glu Val 
110 

Leu Lys Gly lie 
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115 

Asp Phe Lys Glu Asp 
130 

Tyr Asn Ser His Asn 
145 

He Lys Val Asn Phe 
165 

Gin Leu Ala Asp His 
180 

Val Leu Leu Pro Asp 
195 

Lys Asp Pro Asn Glu 
210 

Thr Ala Ala Gly lie 
225 



120 

Gly Asn He Leu Gly His 
135 

Val Tyr He Mer Ala Asp 
150 155 
Lys He Arg His Asn He 
170 

Tyr Gin Gin Asn Thr Pro 
185 

Asn His Tyr Leu Ser Thr 
200 

Lys Arg Asp His Met Val 
215 

Thr His Gly Met Asp Glu 
230 235 



125 

Lys Leu Glu Tyr Asn 
140 

Lys Gin Lys Asn Gly 
160 

Glu Asp Gly Ser Val 
175 

He Gly Asp Gly Pro 
190 

Gin Ser Ala Leu Ser 
205 

Leu Leu Glu Phe Val 
220 

Leu Tyr Lys 



<210> 30 

<211> 225 

<2l2> PRT 

<213> Discosoma sp - 



<400> 30 



Met Arg Ser Ser Lys Asn Val He LyS Glu Phe Met Arg Phe Lys Val 

15 10 15 

Arg Met Glu Gly Thr Val Asn Gly His Glu Phe Glu He Glu Gly Glu 

20 25 30 

Gly Glu Gly Arg Pro Tyr Glu Gly His Asn Thr Val Lys Leu Lys Val 

35 40 45 

Thr Lys Gly Gly Pro Leu Pro Phe Ala Trp Asp He Leu Ser Pro Gin 

50 55 60 

Phe Gin Tyr Gly Ser Lys Val Tyr Val Lys His Pro Ala Asp He Pro 
65 70 75 80 

Asp Tyr Lys Lys Leu Ser Phe Pro Glu Gly Phe Lys Trp Glu Arg Val 

85 90 95 

Met Asn Phe Glu Asp Gly Gly Val Val Thr Val Thr Gin Asp Ser Ser 

100 105 HO 

Leu Gin Asp Gly CyS Phe He Tyr Lys Val Lys Phe He Gly Val Asn 

115 120 125 

Phe Pro Ser Asp Gly Pro Val 'Met Gin Lys Lys Thr Mer Gly Trp Glu 
130 135 140 
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Ala Ser Thr Glu Arg Leu Tyr Pro Arg Asp Gly Val Leu lys Gly Glu 
145 150 155 160 

He His Lys Ala Leu Lys Leu Lys Asp Gly Gly His Tyr Leu Val Glu 

165 170 175 

Phe Lys Ser He Tyr Met Ala Lys Lys Pro Val Gin Leu Pro Gly Tyr 

180 185 190 

Tyr Tyr Val Asp Ser Lys Leu Asp lie Thr Ser His Asn Glu Asp Tyr 

195 200 205 

Thr He Val Glu Gin Tyr Glu Arg Thr Glu Gly Arg His His Leu Phe 
210 215 220 

Leu 
225 



i 



